Inference In Text Understanding
نویسنده
چکیده
The problem of deciding what was implied by a written text, of “reading between the lines’ ’ is the problem of inference. To extract proper inferences from a text requires a great deal of general knowledge on the part of the reader. Past approaches have often postulated an algorithm tuned to process a particular kind of knowledge structure (such as a script, or a plan). An alternative, unified approach is proposed. The algorithm recognizes six very general classes of inference, classes that are not dependent on individual knowledge structures, but instead rely on patterns of connectivity between concepts. The complexity has been effectively shifted from the algorithm to the knowledge base; new kinds of knowledge structures can be added without modifying the algorithm. The reader of a text is faced with a formidable task: recognizing the individual words of the text, deciding how they are structured into sentences, determining the explicit meaning of each sentence, and also making inferences about the likely implicit meaning of each sentence, and the implicit connections between sentences. An inference is defined to be any assertion which the reader comes to believe to be true as a result of reading the text, but which was not previously believed by the reader, and was not stated explicitly in the text. Note that inferences need not follow logically or necessarily from the text; the reader can jump to conclusions that seem likely but are not 100% certain. In the past, there have been a variety of programs that handled inferences at the sentential and inter-sentential level. However, there has been a tendency to create new algorithms every time a new knowledge structure is proposed. For example, from the Yale school we see one program, MARGIE, [Schank et al., 19731 that handled single-sentence inferences. Another program, SAM [Cullingford, 19781 was introduced to process stories referring to scripts, and yet another, PAM, [Wilensky, 19781 dealt with plan/goal interactions. But in going from one program to the next a new algorithm always replaced the old one; it was not possible to incorporate previous results except by re-implementing them in the new formalism. Even individual researchers have been prone to introduce a series of distinct systems. Thus, we see Charniak going from demon-based [Charniak, 19721 to frame-based This work was supported ia past by National Science Foundation grant IST-8208602 and by Defense Advanced Research Projects Agency contrad NWO39-84-C-0089. [Charniak, 19781 to marker-passer based [Charniak, 19863 systems. Granger has gone from a plan/goal based system [Granger, 19801 to a spreading activation model [Granger, Eiselt and Holbrook, 19841. Gne could say that the researchers gained experience, but the programs did not. Both these researchers ended up with systems that are similar to the one outlined here. I have implemented an inferencing algorithm in a program called FAUSTUS (Pact Activated Unified STory Understanding System). A preliminary version of this system was described in [Norvig, 19831, and a complete account is given in [Norvig, 19861. The program is designed to handle a variety of texts, and to handle new subject matter by adding new knowledge rather than by changing the algorithm or adding new inference rules. Thus, the algorithm must work at a very general level. The algorithm makes use of six inference classes which are described in terms of the primitives of this language. The algorithm itself can be broken into steps as follows: step 0: Clonstruct a kwwk ase defining general concepts like actions, locations, and physical objects, as well as specific concepts like bicycles and tax deductions. The same knowledge base is applied to all texts, whereas steps l-5 apply to an individual text. Step I: Construct a semantic representation of the next piece of the input text. Various conceptual analyzers (parsers) have been used for this, but the process will not be addressed in this paper. Occasionally the resulting representation is vague, and FAUSTUS resolves some ambiguities in the input using two kinds of non-marker-passing inferences. Step 2: Pass markers from each concept in the semantic representation of the input text to adjacent nodes, following along links in the semantic net. IMarkers start out with a given amount of marker energy, and are spread recursively through the network, spawning new markers with less energy, and stopping when the energy value hits zero. (Each of the primitive link types in KODIAK has an energy cost associated with it.) Each marker points back to the marker that spawned it, so we can always trace the marker path from a given marker back to the original concept that initiated marker passing. Step 3: Suggest Inferences based on marker collisions. When two or more markers are passed to the same concept, a marker collision is said to have occurred. Por each collision, From: AAAI-87 Proceedings. Copyright ©1987, AAAI (www.aaai.org). All rights reserved.
منابع مشابه
The Impact of Contextual Clue Selection on Inference
Linguistic information can be conveyed in the form of speech and written text, but it is the content of the message that is ultimately essential for higher-level processes in language comprehension, such as making inferences and associations between text information and knowledge about the world. Linguistically, inference is the shovel that allows receivers to dig meaning out from the text with...
متن کاملA Parallel-Process Model of On-Line Inference Processing
This paper presents a new model of on-line inference processes during text understanding. The model, called ATLAST, integrates inference processing at the lexical, syntactic, and pragmatic levels of understanding, and is consistent with the results of controlled psychological experiments. ATLAST interprets input text through the interaction of independent but communicating inference processes r...
متن کاملAN INTEGRATED FIS-QFD MODEL FOR EVALUATION OF INTERNET SERVICE PROVIDER
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کاملAligning Semantic Graphs for Textual Inference and Machine Reading
This paper presents our work on textual inference and situates it within the context of the larger goals of machine reading. The textual inference task is to determine if the meaning of one text can be inferred from the meaning of another and from background knowledge. Our system generates semantic graphs as a representation of the meaning of a text. This paper presents new results for aligning...
متن کاملAN INTEGRATED FIS-QFD MODEL FOR EVALUATION OF INTERNET SERVICE PROVIDER
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کاملLexical Inference Mechanisms for Text Understanding and Classification
This paper describes a framework for building story traces (compact global views of a narrative) and story projections (selections of key elements of a narrative) and their applications in text understanding and classification. Word and sense properties are extracted using the WordNet lexical database enhanced with Prolog inference rules and a number of lexical transformations. Inference rules ...
متن کامل